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METHODS AND SYSTEMS FOR INTERACTIVE INVESTIGATION OF 

GEOPHYSICAL DATA 

5 BACKGROUND OF THE INVENTION 

[0001] This application relates generally to analysis of geophysical data. More 

specifically, this application relates to methods and systems that use cross-correlation 
techniques in the analysis of geophysical data. 

10 

[0002] There are a number of applications in which it is desirable to identify 

geological structures. For example, in many instances, the presence of subsurface geological 
structures such as faults or other strati graphic features may be used in identifying locations of 
hydrocarbon reservoirs so that drilling sites may be specified. Typically, the identification of 

15 geological structures begins with the collection of geophysical data, which may be in the 
form of seismic or other types of geophysical data. For example, seismic data may be 
collected by distributing acoustic sources at an area and using the sources to synthesize 
physical shock waves that reflect off rock strata at variable velocities and return to the 
surface. Geophones at the surface measure and record ground motion to define a seismic 

20 response in the form of a data set. 

[0003] The collected data are subsequently analyzed, with the reliability of 

interpretations made from the data being very much tied to the manner of analysis. There 
have accordingly been a significant number of attempts to develop reliable analytical 
techniques for geophysical data that have good predictive power. Many of these approaches 

25 are based on pattern-recognition techniques and may incorporate various artificial- 
intelligence techniques such as neural nets, experts systems, genetic algorithms, and the like. 
A common feature of these techniques is that they attempt to identify some attribute that may 
be extracted from the data and to which the pattern-recognition approaches may be applied. 
These techniques accordingly require a relatively high level of user input in their execution 

30 and, in practice, their reliability has tended to be disappointing. A basic deficiency of all 

such techniques is their dependence on identifying an attribute that may be extracted from the 
data that provides a parameter set having suitably predictive power; the ability to do so has 



been constrained by the variation and complexity of data that arise from diverse real-world 
applications. 

[0004] There is accordingly a general need in the art for improved methods and 

systems for analyzing geophysical data. 

5 

BRIEF SUMMARY OF THE INVENTION 

[0005] Embodiments of the invention make use of a technique that calculates cross- 

correlations based on underlying data without attempting to identify specific attributes 

10 indicative of certain features. A method is thus provided for investigating an image set of 
geophysical data distributed over a first //-dimensional volume where TV> 2. The method 
makes use of a known feature of interest within a target set of geophysical data distributed 
over a second //-dimensional volume. A subvolume that encloses the known feature of 
interest is selected. At each of a plurality of positions within the first TV-dimensional volume 

15 a cross-correlation is calculated between the data distributed within the subvolume and 
corresponding data distributed in the first TV-dimensional volume about the position. 

[0006] In some instances, a distribution of results of the cross-correlation calculation 

may be displayed for a user. The subvolume may be modified in accordance with a user 
instruction, with the cross-correlation calculations being repeated at each of a plurality of 

20 positions within the first TV-dimensional volume using the modified subvolume. The 

subvolume may be modified by changing a size of the subvolume a by changing a shape of 
the subvolume. In some embodiments, each of the image set and the target set consists of 
real data. The cross-correlation may be calculated from such real data as a real cross- 
correlation. Alternatively, the real data of the image and target sets may respectively be 

25 transformed to complex data by performing a Hilbert transform of the real data, with the 
cross-correlation being calculated as a complex cross-correlation. Such a complex cross- 
correlation permits calculation of an amplitude envelope and phase distribution over the first 
TV-dimensional volume. In some cases, the data distributed within the subvolume may be 
normalized over the subvolume prior to calculating the cross-correlation. 

30 [0007] The image and target sets may comprise seismic data or may comprise 

mathematical transforms of seismic data in different embodiments. In one embodiment, the 
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image set is the target set. Also, the subvolume may take a variety of different shapes in 
different embodiments, including the shape of a parallelepiped in one embodiment. 

[0008] The methods of the invention may be embodied in a computer-readable 

storage medium having a computer-readable program for directing operation of a computer 
5 system. The computer system may include an input device, a processor, and a display device. 
The computer-readable program includes instructions for operating the computer system to 
investigate geophysical data in accordance with the embodiments described above. 



BRIEF DESCRIPTION OF THE DRAWINGS 

10 

[0009] A further understanding of the nature and advantages of the present invention 

may be realized by reference to the remaining portions of the specification and the drawings 
wherein like reference numerals are used throughout the several drawings to refer to similar 
components. 

15 

[0010] Figs. 1 A - 1C provide schematic diagrams of data sets used in illustrating 

embodiments of the invention conceptually; 

[0011] Fig. 2 provides a flow diagram summarizing methods of the invention in 

various embodiments; and 

20 [0012] Fig. 3 provides a schematic illustration of a computer system on which 

methods of the invention may be embodied. 



DETAILED DESCRIPTION OF THE INVENTION 



25 [0013] Embodiments of the invention derive from a recognition by the inventors that 

many of the prior- art analytical techniques used in the evaluation of geophysical data rely on 
attempts to identify statistical reorganizations of underlying data that have good predictive 
value. Rather than engage in the difficulties that are necessarily associated with making such 
identifications, embodiments of the invention instead avoid the entire question of which 

30 statistical reorganization to choose by relying on the underlying data itself. Essentially, 

embodiments of the invention use a mapping of cross-correlation values to identify regions 
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within a volume of data that are similar to data known to include a feature of interest. This 
method imposes no predetermined expectation that any specific statistical reorganization of 
the data is more or less likely to identify features of interest. Instead, the method simply 
looks for regions of data that have similar characteristics to what is desired, without ever 
5 needing to specify what those characteristics are. In this way, multiple characteristics or 
attributes are accommodated automatically. Fundamentally, everything that is known about 
the data is embodied by the basic data itself; it is tautological that no particular statistical 
reorganization can ever provide more information than is inherent in the underlying data 
itself. Advantageously, the methods of the invention may greatly simplify data analysis by 
10 having an operator merely examine a display to identify high- amplitude events. 

[0014] An overview of methods for investigating geophysical data is shown 

schematically in Figs. 1 A - 1C. As used herein, the term "geophysical data" is intended to 
refer to any data derived from measurements of the Earth. In some instances, such 
geophysical data may comprise seismic data, which are derived from acoustic signals 

15 propagated through the Earth, but may alternatively encompass magnetic data, 

electromagnetic data, gravimetric data, and the like. Furthermore, the geophysical data may 
comprise raw field recordings or may comprise data that have been preprocessed using 
known techniques such as stacking, migration, offsetting, and the like. For instance, the 
geophysical data might comprise prestack data, limited-offset data, migrated poststack data, 

20 and the like. Also, the geophysical data may comprise a measured quantity such as amplitude 
or a derived quantity such as instantaneous phase or frequency. 

[0015] As shown schematically in Fig. 1, methods in embodiments of the invention 

make use of a target set of geophysical data 102 and an image set of geophysical data 104. 
The image set 104 is to be investigated for potential characteristics corresponding to a feature 

25 of interest known to be contained in the target set 102. For illustrative purposes, the feature 
of interest is denoted by reference numeral 106, although it will be appreciated that the 
feature of interest may not be readily discernible visually from the data within the target set 
102. In particular, identification of the feature of interest and its location within the region 
encompassed by the target set 102 will more usually have been the result of an extensive 

30 analysis. Such analysis could include studies performed by a geophysicist or geologist, and 
may include physical studies of the region such as by drilling in the location of the feature 
and confirming the presence of a hydrocarbon deposit. 
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[0016] In many instances, the target dataset comprises directly measured data or data 

derived from direct measurements, but more generally the target dataset may alternatively 
comprise model data. For example, in one embodiment, the target dataset may comprise 
synthetic data created using techniques such as wave-equation modeling based an a 
5 geophysicist's ideas about what may be in the subsurface or modeling a seismic response 
based on well data from adjacent regions. Also, a particular example of processed data that 
may be comprised by the target and image data sets is prestack data. For instance, in a 
specific embodiment, model prestack data are created containing likely seismic anomalies 
and the method described below is run on sets of prestack seismic gathers comprised by the 
10 image dataset. 

[0017] It is not necessary that the entire target data set 102 be understood as 

thoroughly as the portion that contains the feature of interest, although in some instances the 
entire physical region corresponding to the target data set 102 may have been studied 
meticulously. For example, in one embodiment, the image data set 104 and the target data set 

15 102 are the same data sets, with methods of the invention being used to investigate the whole 
data set to identify features similar to the known feature of interest 106. In other 
embodiments, the image data set 104 and the target data set 102 may be different data sets 
corresponding to data collected from different physical regions. In such embodiments, 
knowledge of the existence of the known feature of interest 102 in one physical location is 

20 used to investigate a different physical location for the presence of similar features. 

[0018] While the example shown in Figs. 1 A - 1C illustrates an embodiment of the 

method for three-dimensional image and target data sets 104 and 102, the methods of the 
invention may be applied more generally to data sets in two or more dimensions. 
Furthermore, the dimensions need not necessarily correspond exclusively to physical 

25 dimensions, although in some embodiments all the dimensions are physical dimensions. For 
example, data sets in three dimensions may define physical volumes over which data were 
collected when all three dimensions are physical dimensions; alternatively, data sets in three 
dimensions might instead define planes over which data were collected at different times 
when two dimensions are physical dimensions and the third dimension is a time dimension. 

30 Similarly, data sets in four dimensions may define physical volumes over which data were 
collected at different times when three of the dimensions are physical dimensions and one of 
the dimensions is a time dimension. The use of such time information may be useful, for 
example, in applications where temporal changes in characteristics are relevant. For 
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example, a feature in a target data set 102 spanning a time dimension may include 
information on how a geophysical characteristic has changed over time, allowing the image 
data set 104 to be investigated for geophysical characteristics that show the some type of 
temporal evolution. 

5 [0019] As shown in Fig. IB, an operator may select a subvolume 108 of the target 

data set 102 that encloses the known feature of interest 106. As explained in further detail 
below, in some embodiments this subvolume may undergo modification during the method as 
part of an interactive component with the operator. There are a variety of ways in which the 
selection of the subvolume 108 may occur. In one convenient embodiment, such a selection 

10 is made by providing a graphical display to the operator, with which the operator may select 
the subvolume 108 by defining a trace. For instance, in the example where the target data set 
102 is a three-dimensional data set, it may be displayed as a three-dimensional volumetric 
display so that the subvolume 108 may be identified by the operator using a mouse to trace 
out the subvolume. In other embodiments, other mechanisms may be used to identify 

15 subvolumes, such as by having the operator key coordinate information, use voice commands 
to input coordinate information, and the like. 

[0020] The target data within the subvolume 108 are then used to calculate a cross- 

correlation with data in the image data set 104. As shown in Fig. 1C, this is typically done 
around a plurality of positions 112 within the image data set 104 so that cross-correlation 

20 values over a region within the image data set 104 may be determined. The cross-correlation 
values provide an indication of how similar a corresponding subvolume within the image data 
set 104 is to the selected subvolume 108 from the target data set 102. As described further 
below, it may be desirable at different stages in the method only to calculate cross-correlation 
values for a subset of the image data set 104. For example, Fig. 1C illustrates an example in 

25 which the plurality of positions 112 around which the cross-correlations are performed is 
reached with trace 110 that maps out a plane within the image data set 104. At other stages 
or in other embodiments, a trace may be used that maps through the entire image data set 
104. 

[0021] While the example of Figs. 1 A - 1C shows the image data set 104, the target 

30 data set 102, and the subvolume 108 as rectangular parallelepipeds, there is no requirement 
that they take on these shapes. While the use of parallelepipeds is convenient for certain 
types of geophysical data acquisition, the method may more generally accommodate any 
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shapes, including ellipsoids, spheres, and the like in three dimensions, or their counterparts in 
other dimensions. 



the invention, a more detailed description is now made in connection with the flow diagram 
5 provided in Fig. 2. While this flow diagram sets forth a certain sequence of operations, it 
should be understood that neither this sequence nor the specific operations shown is 
necessarily required. In alternative embodiments, some of the operations may be omitted, 
other operations may be performed, and/or some of the illustrated operations may be 
performed simultaneously or in a different order than that shown. Furthermore, while the 
10 equations set forth in the description of Fig. 2 provide examples where the target data set 102, 
image data set 104, and sub volume 108 are three-dimensional, their counterparts for 
applications in other dimensions will be evident to those of skill in the art. 

[0023] At block 204, the target data set 102 is received and is denoted t(x,y,z) over 

coordinates (x,y,z). Similarly, at block 208, the image data set 104 is received and is 

1 5 denoted / (x, y 9 z). The target and image data sets may consist of real data, where "real data" 
refers to data having no imaginary component. While in some embodiments, the calculation 
of the cross-correlation may be performed directly with such real data, more useful 
information may be obtained by transforming such real data to complex data by performing a 
Hilbert transform. The use of complex data advantageously provides a full measure of 

20 waveform relationships and, as such, permits subsequent calculation of a complex cross- 
correlation to allow the extraction of both an amplitude envelope and phase. The inventors 
have found that this allows more clear identification of features in the image data set 104 that 
match the known feature of interest 106 in the target data set 102. Thus, at block 212 a 
Hilbert transform is calculated of the real target data to form a complex target data set, viz., 



[0022] 



While the sequence in Figs. 1 A - 1C provides an overview of embodiments of 



25 



t(x, y, z) <- 1 (x, y, z) + iH^ [t(x 9 y, z)] , 



where TC t denotes the usual Hilbert transform 




Similarly, at block 216, a Hilbert transform is calculated of the real image data to form a 
complex image data set, Wz., 
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f(x, y, z) <- f(x, y, z) + iU, [f(x 9 y, z)] . 

[0024] The subvolume that is used for initial calculations of the cross-correlation is 

selected by an operator so that an instruction identifying that subvolume is received at block 
220. Any of the methods previously mentioned may be used by the operator to identify the 
5 subvolume, such as by providing a trace to identify a region that encloses the feature of 
interest 106. In some instances, software may assist in the identification so that the trace 
provided by the operator is translated into a standardized subvolume shape. In some 
embodiments, the identified subvolume is normalized as indicated at block 224: 

t (x,y,z)= j 

10 where T is the average value of the target data t within the identified subvolume. There are a 
number of benefits provided by such normalization in embodiments where it is used. For 
instance, in cases where the subvolume remains un-normalized, results of the subsequent 
cross-correlation calculations may be dependent on the size of the feature of interest 106. In 
such cases, the identification of similar features in the image data set 104 maybe complicated 

15 by spikes or local amplitude changes in the cross-correlation calculations. There are, 

moreover, instances where the identification of similar features may fail spuriously if no 
normalization is performed, such as in certain cases where the image energy varies with 
position. 

[0025] With the subvolume identified, the method may proceed by calculating cross- 

20 correlations about each of a plurality of positions within the image data set. In some 
embodiments, there may be an interactive process in which the operator will initiate 
modifications to the subvolume based on results of the cross-correlation calculations, such as 
by changing the size and/or shape of the subvolume. Depending on the specific 
characteristics of the target and image data sets, and the possible features within them, the 
25 size of the subvolume that provides the most useful information may vary. Small 

subvolumes will tend to produce larger numbers of matching features while large subvolumes 
will tend to provide smaller numbers of matching features. This may be illustrated by 
considering the limiting cases where the subvolume is a point and where the subvolume is 
equal to the entire data set. When the subvolume is a point, the cross-correlation calculations 
30 will simply remap the image data set producing an identity; when the subvolume is equal to 



the entire data set, the cross-correlation will simply produce a spike at the origin. In either of 
these limiting cases, the cross-correlation calculations provide no additional information, 
illustrating the desirability to have an intermediate subvolume size. 

[0026] Accordingly, it may be more efficient in some embodiments for initial cross- 

5 correlations to be performed only over a specified subset of the image data set 104 until the 
operator has determined the desired size for the subvolume. For example, in some 
embodiments, a plane within the image data set 1 04 may be selected, corresponding to a 
crossline, inline, or timeslice section, or a planar slice of any orientation within the image 
dataset. At block 228, an initial point (w, v, w) is selected within the specified subset of the 
10 image data set. The cross-correlation is then calculated about the selected point at block 232, 
with the cross-correlation being a complex cross-correlation in embodiments where the 
Hilbert transforms have been calculated: 

F(u 9 v, w) = ^ = . 

This process is repeated as indicated at blocks 236 and 240 until the cross-correlation value 
1 5 has been calculated at each of the plurality of positions identified within the specified subset. 
At that point an amplitude envelope 

A = yl(ReF) 2 +(TmF) 2 

may be calculated from the cross-correlation at block 248 and an instantaneous phase 

ImF 

cp = tan 

ReF 

20 calculated at block 244. The phase may be unwrapped at block 252 and the amplitude and 
phase results displayed to the user at block 256. 

[0027] There are a variety of ways in which the amplitude and phase results may be 

displayed, including both in textual and graphical form. A graphical format may 
advantageously use color coding to render contrasting amplitude values that indicate features 
25 in common between the subvolume of the target data set 102 and the image data set 104. The 
operator may consider the display and input a modification of the subvolume, as checked at 
block 260, to prompt a recalculation of the results with the modified subvolume. Usually, 
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such a modification will consist of changing the size of the subvolume without changing its 
shape, but in other embodiments shape changes may also be introduced. In this way, operator 
interactivity may be used to converge towards a subvolume size and/or shape that is 
especially suitable for a given investigation. The fact that smaller subvolumes produce larger 
5 numbers of matching features may be exploited during such interactivity. As the subvolume 
size is increased, the number of highly contrasting amplitude values in the graphical display 
is reduced, effectively causing matching features to "pop out" in the display. The evaluation 
required by an operator is thus greatly simplified since the method inherently allows the 
operator merely to examine the display for high-amplitude events. It is generally expected 
10 that experienced operators will be able to rely on past investigations to make efficient 

assessments of the data and thereby quickly determine the desired size of the subvolume. 

[0028] Once the operator has decided on a particular subvolume size, the cross- 

correlation calculations may be performed for the entire image data set 104 in a non- 
interactive mode at block 264. The final results may then be rendered in a format suitable for 
15 multidimensional analysis, such as through use of three-dimensional viewing systems for 
three-dimensional data sets. The rendering of the final results is suitable for study by a 
geophysicist to identify high-amplitude values within the image data set 104 that indicate the 
presence of features similar to the known feature of interest. 

[0029] While the above discussion has focused on an interactive process that uses 

20 only a subset of the image data set 1 04, it is of course possible for such interactivity to be 
performed on cross-correlation calculations for the entire image data set 104. This is 
particularly true in embodiments that provide fast processing and sophisticated 
multidimensional rendering tools. Furthermore, even in embodiments where a subset is used 
as described, still further calculations might still sometimes be performed after analysis of the 
25 results produced at block 264. For example, while the interactive analysis on a subset may 
provide a good determination of a desired subvolume size, the later analysis on the full set of 
results may suggest a further modification of the subvolume size to enhance or reduce certain 
characteristics identified in the results. 

[0030] The methods of Fig. 2 may be performed with any suitably programmed 

30 computational device, an exemplary structure of which is provided in Fig. 3. This figure 

broadly illustrates how individual system elements of such a computational device 300 may 
be implemented in a separated or more integrated manner. The computational device 300 is 

10 



shown comprised of hardware elements that are electrically coupled via bus 326, including a 
processor 302, an input device 304, a display or other output device 306, a storage device 
308, a computer-readable storage media reader 310a, a communications system 314, a 
processing acceleration unit 316 such as a DSP or special-purpose processor, and a memory 
318. The computer-readable storage media reader 3 10a is further connected to a computer- 
readable storage medium 310b, the combination comprehensively representing remote, local, 
fixed, and/or removable storage devices plus storage media for temporarily and/or more 
permanently containing computer-readable information. The communications system 314 
may comprise a wired, wireless, modem, and/or other type of interfacing connection. 

[0031] The computational device 300 also comprises software elements, shown as 

being currently located within working memory 320, including an operating system 324 and 
other code 322, such as a program designed to implement methods of the invention. It will 
be apparent to those skilled in the art that substantial variations may be made in accordance 
with specific requirements. For example, customized hardware might also be used and/or 
particular elements might be implemented in hardware, software (including portable 
software, such as applets), or both. Further, connection to other computing devices such as 
network input/output devices may be employed. 

[0032] While the above is a complete description of specific embodiments of the 

invention, it should not be taken as limiting the scope of the invention, which is defined in the 
following claims. 
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